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BACILLUS THORINGIENSTS CPYfiTar, 
PROTEIN GENE TOXIN SEGMENT 
Field of fche ImwnH ftn 
This invention relates generally to DNA 
5 sequences capable of being expressed in genetically 
engineered host organisms. More particularly, this 
invention relates to an expressible DNA fragment coding 
for the toxin portion of Bacillus thuringt«fiiH« crystal 
protein. 

10 Background of the Invents nn 

As is well known, Bacillus thurinqiAng* g 
crystal protein is toxic to the larvae of a number of 
lepidopteran insects* As a result preparations 
containing Bacillus thuringiensis crystals are used 

15 commercially as a highly selective biological 
insecticide. Unfortunately, relatively high 
manufacturing costs and problems connected with the use 
of the crystals have made it difficult for such 
insecticides to compete effectively with other 

20 commercially available products. 

Wild-type BacillUS thuringiengis produce 
crystal protein only during sporulation. Such a growth 
phase limitation, particularly in an industrial 
process, can result in inconvenience and excessive time 

25 requirements during manufacture. This of course 
increases the costs of the final crystal protein 
insecticide product. 

To overcome the growth phase limitations of 
wild-type Bacillus fchuringigng* a. U.S. Patents 

30 4,448,885 and 4,467,036, issued Hay 15, 1984 and August 
21, 1984, respectively, to Schnepf and Whiteley, 
disclose expression of Bacillus thuringiensis crystal 
protein by use of novel recombinant plasmids containing 
expressible heterologous DNA coding for crystal protein 

35 protoxin polypeptide. The Schnepf /Whiteley patents 
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also disclose that genetically engineered bacterial 
host strains, transformed by the novel recombinant 
plasmids, express Baetiias rhnringiensis. crystal 
protein protoxin polypeptide. Such genetically 
engineered bacterial host strains express Bac i l l us 
»h»r< n? unats crystal protein protoxin polypeptide, at 

all stages of growth. 

It is now known that in the Bacillus 
f h'T' 4 "^ »"s*s subspecies that synthesize lepidopteran 
toxins, the crystal protein crystal is composed of one 
or more protoxin polypeptides of Mr«135,000 to 160,000 
(Calabrese, et al., 1980; Tyrell, et al., 1981). Upon 
dissolution and proteolytic degradation of crystals ia 
vitro and presumably also in the insect midgut, each 
15 protoxin molecule yields a toxin peptide of 

Mr»55, 000-72, 000 (Bulla, et al., 1979; Lilley, et al., 
1980; Chestukhina, et al., 1982). 

Since the crystal protein protoxin 
polypeptides are approximately twice the size of the 
20 toxic fragments, insecticidal preparations containing 
the protoxin crystals could be made to be twice as 
effective per given dose (or equally as effective at 
half the dose) if they contained the smaller toxin 
fragment rather than the larger protoxin polypeptide. 
25 In addition, if genetically engineered host strains 
could be transformed to express the toxin fragment 
rather than the protoxin polypeptide, the output of the 
host could be increased and possibly even doubled. To 
that end it would be useful to identify the specific 
30 segment of a **r.i i lua fhnrinqtensia crystal protein 

gene that codes for the toxin fragment. It would also 
be useful to demonstrate that such a DNA segment is 
expressible in host organisms and that the resultant 
protein product is toxic to lepidopteran insects. 



Objects 

It is an object of the present invention to 
identify the segment of a Bacillus thuringignsis 
crystal protein gene, referred to herein as the 
"toxin-encoding" segment, which codes for the portion 
of a crystal protein that is toxic to lepidopteran 
insects. 

It is a further object of the present 
invention to determine the DNA sequence of the 
"toxin-encoding" segment of a Bacillus thuringiensis 
crystal protein gene. 

It is a further object of the present 
invention to demonstrate that a Bacillus thuringiensis 
crystal protein "toxin-encoding" gene segment is 
expressible in transformed recombinant host organisms. 

It is a further object of the present 
invention to demonstrate that the protein product 
produced by recombinant hosts transformed to express a 
Baeiilus thuringiensis crystal protein "toxin-encoding" 
gene segment is toxic to lepidopteran insects. 

Other objects of the present invention will 
become apparent to those skilled in the art from the 
following description and figures. 

FIGURES 

General D Pgcrlpfcion of the Figures 
FIGURE 1 (views A and B) shows restriction 

enzyme maps and sequencing strategy for a Bacillus 

frhtuHngiensis crystal protein gene; 

FIGURE 2 (views A and B) shows the DNA 

sequence of a Bacillus thuringiensis crystal protein 

gene; 

FIGURE 3 (views A and B) shows restriction 
maps of pESl and construction of pHES16; 

FIGURE 4 (views A-E) shows the construction 
strategy for the 3* -end deletions of a crystal protein 
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gene in pHESl6; 

FIGURE 5 (views A-C) shows construction of 
plasmids fusing a crystal protein gene to the lac 
promoter ; 

5 FIGURE 6 (views A-C) shows the 3*-end 

deletions of a crystal protein gene; and 

FIGURE 7 (views A-C) shows transcriptional and 

translational fusions of lacZ to the crystal protein 

gene and to crystal protein genes modified at the 3 1 

10 and/or 5 1 ends. 

PffffljlA* Descn' pi-ion of the Figures 

FIGURE 1A is a restriction enzyme map of pESl 
showing the portion of the plasmid which was sequenced 
(boxed segment) r the length and direction of the 
15 crystal protein transcript (arrow) r and the portion of 
a subcloned restriction fragment which was used as an 
SI nuclease mapping probe (bracketed line). The 
thicker lines represent vector (pBR322) sequences. 

FIGURE IB is a restriction enzyme map and 
20 sequencing strategy for ca» 2800 bp of the crystal 
protein gene. The sequence of the first 1500 bp has 
been reported (Wong, et al., 1983). Arrows indicate 
the length and direction of sequence determination from 
the sites presented. 
25 FIGURES 2A and 2B show the UNA sequence of the 

crystal protein gene, including some 5 1 and 3' flanking 
sequences. The start sites of transcription in 
Bacillus fhniHnqignsis (Btl and Btll) and in 
Rgghgrichia coll (Ec) are indicated as well as the 
30 entire deduced amino acid sequence. FIGURE 2A includes 
nucleotides 1 through 2200 f plus corresponding codons 1 
through 558. FIGURE 2B includes nucleotides 2201 
through 4222 f plus codons 559 through 1176. 
Nucleotides 4053 through 4055 comprise codon 1176 which 
35 codes for the carboxy terminal amino acid in the 
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Bacillus fchuringienais crystal protein* Nucleotides 
4140 through 4185 comprise the transcriptional 
terminator for this gene* 

FIGURE 3A shows restriction enzyme site maps 
5 of pESl linearized at the single Sai l site* A partial 
map is shown for Xmni (solid line) and complete maps 
are shown for fiindlll, £aiil, and Hdal. The location 
of the crystal protein gene is shown with a dashed box, 
and an arrow indicates the direction of transcription. 

10 FIGURE 3B is a restriction enzyme map of pESl 

which shows the location of Tn5 insertion B8 
(stem-and-loop structure) , the position of the crystal 
protein gene (thickened line) , the direction of 
transcription (arrow) , and the position of pBR322 

15 vector sequences (boxed lines) • The Xhal-Hindlll 

fragment used to construct pHES16 (cross-hatched line) 
and its position in that plasmid are indicated. 

FIGURE 4A is an expanded restriction enzyme 
map of a segment of pESl containing the crystal protein 

20 9 ene (thickened line) and 3 1 flanking sequences. The 
positions of Tn5 insertions in the flindlll E fragment 
are indicated with arrows. 

FIGURE 4B shows the strategy for obtaining 
3' -proximal deletions in the flindlll E fragment of pESl 

25 from bacteriophage H13mp8 containing the ££iiH C 
fragment of pESl cloned into the Smal site in the 
indicated orientation. 

FIGURE 4C shows the strategy for constructing 
deletions in pESl containing Tn5 insertions in the 

30 flindlll E fragment. 

FIGURE 4D illustrates the method used to 
extend the crystal protein gene segment in pHESlS to 
deletion endpoints in the flindlll E fragment of pESl. 
The plasmids derived from the M13 phages or the Tn5 

35 insertion mutants are indicated on the connecting 
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arrows. (pHES38 f indicated parenthetically f was made 
in the manner indicated from a phage containing Xmnl 
fragment 5 (FIGURE 3A) and required no further 
deletion. 

5 FIGURE 4E illustrates the strategy used to 

make deletion derivatives (pHES37, 39, 40) of pHES35. 
(See FIGURE 4D for the construction of pEHS35.) 

FIGURE 5 A shows transcriptional fusion of the 
lac promoter to the crystal protein gene. A map of 

10 pESl linearized by Sail is presented, showing the 
location (thickened line) and direction of 
transcription (arrow) of the crystal protein gene. 
Vector sequences are boxed. The location and direction 
of transcription of the lac promoter are indicated with 

15 a £ and an arrow. The location of the lac 

alpha-complementation segment is indicated by the 
symbol for the Greek letter alpha. 

FIGURE 5B shows translational fusion of the 
lSCZ gene to the 10th and 50th ccdons of the crystal 

20 protein gene* A map of pESl indicating the position 
and orientation of the crystal protein gene is 
presented. The Xmn i partial/Xhal complete digest 
fragments used in pHES 44 and 45 are also indicated by 
lines. A detailed description of these constructions 

25 is found in the Materials and Methods section, ammu 
FIGURE 5C shows translational fusions of the 
lac z gene to the 10th and 50th codons of a truncated 
Tn5 insertion mutation of the crystal protein gene. 
Maps of pESl-B22 and pESl-B22-l, its Xfcal deletion 

30 derivative, are shown. The Xmni partial/Xhal complete 
digestion fragments used to make pHES4S and 47 are 
indicated with lines beneath the maps. 

FIGURE 6A is a restriction enzyme map of pESl 
with an expanded map of the crystal protein gene 

35 region. The crystal protein gene (thickened line) , its 
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direction of transcription (arrow) and pBR322 vector 
sequences (boxed lines) are indicated. The portion of 
the crystal protein gene remaining in the indicated 
plasmids is shown by a line under the restriction 
5 enzyme maps. Boxed segments of these lines indicate 
Tn5 sequences. The results of toxicity test for 
Escherichia £ali containing these plasmids are also 
shown • 

FIGURE 6B shows immunoblots of extracts of 

10 Escherichia coll- carrying the following plasmids: 

lanes 1, p£Sl-B8; 2, pH£S19; 3, pHBS32; 4, pHES23; 5 r 
PHES25; 6, pBES33; 7, pBES30; 8, pBESIS; 9 pBES31 
(containing the Hindi I I insert of pBES32 in reversed 
orientation) ; 10 f pBES322* Lane 1 contains 1 

15 microliter of extract; lanes 2-10 contain 20 
microliters of extract. 

FIGURE 6C shows immunoblots of extracts of 
Escherichia soli carrying the following plasmids: 
lanes 1, pESl-B8; 2, pHES34; 3, pHES35; 4, pHES36; 5 f 

20 pBES37; 6, pHES38; 1, pHES39; 3, pBES40. Lane 1 

contains 1 microliter of extract; lanes 2-8 contain 20 
microliters of extract. 

FIGURE 7A is a restriction enzyme map of pESl 
with an expanded map of the crystal protein gene 

25 region. The crystal protein gene (thickened line), its 
direction of transcription (arrow) and pBR322 vector 
sequences (boxed lines) are indicated. Lines below the 
maps showing the amount of crystal protein gene 
remaining in the indicated plasmids are shown under the 

30 restriction enzyme maps. For the deleted plasmids f 

boxes at the left end indicate translational fusions to 
lacZ; boxes at the right end indicate Tn5 sequences. 
Dashed thickened lines at the right end indicate lac. 
alpha-peptide sequences and the non-dashed thickened 

35 lines at the right end indicate the final 74 codons and 



transcriptional terminator of the crystal protein gene. 
The results Of toxicity tests for Escherichia coli 
containing these plasmids are also shown. 

FIGURE 7B shows immunoblots of Escherichia 
ctrli carrying the following plasmids: Lanes l r pHES41; 
2, pHES44; 3, pHES45; 4, pHES43; 5 r pHES46; 6, pHES47. 
Lanes 1-3 contained 5 microliters of extract; lanes 4-6 
contained 10 microliters of extract. 

FIGURE 7C shows immunoblots of Escherichia 
cart carrying the following plasmids t Lanes 1, pHES43; 
2, pHES50; 3, pHES48; 5, pHES49; 6, pflES52; 7 pBRS322. 
Each lane contained 20 microliters of extract. 
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Definitions 

In the present description and claims 
reference will be made to terms and phrases which are 
expressly defined for use herein as follows: 

As used herein, the letters A, T, C, G are 
meant to denote the nucleotides adenine, thymine, 
cytosine and guanine in DNA, respectively. 

As used herein, bp means base pairs. 

As used herein, substantial sequence homology 
is meant to denote nucleotide sequences that are 
substantially functionally equivalent to one another. 
Nucleotide differences between such sequences having 
substantial sequence homology will be dfi minimus in 
affecting the function of the gene products coded for 
by such sequences. 

As used herein, amino acid abbreviations are: 



Phenylalanine Phe Bistidine Bis 

Leucine Leu Glutamine Gin 

Isoleucine lie Asparaglne Asn 

Methionine Met Lysine Lys 

Valine Val Aspartic acid Asp 

Serine Ser Glutamic acid Glu 
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Prollne Pro Cysteine Cys 

Threonine Thr Tryptophan Try 

Alanine Ala Arginlne Arg 

Tyrosine Tyr Glycine Gly 



5 As used herein, crystal protein gene means a 

DNA segment that codes for a ggcttlttg tftur ingi'ensis 
crystal protein peptide that is toxic to lepidopteran 
insects. The term "protoxin" means a crystal protein 
polypeptide of ca. M r =»130,000 - 160,000 which upon 

10 dissolution and proteolytic degradation (in vitro and 
presumably also in the insect midgut) yields a smaller 
■toxin" fragment of ca. M r 55,000-73,000 that is itself 
toxic to lepidopteran insects. The phrase "amino 
terminal 55% of the Bacillus Hmringiensig crystal 

15 protein gene" means the amino terminal 645 codons of 

the crystal protein gene as shown in FIGURES 2 A and 2B. 

As used herein, transcriptional terminator 
means that DNA sequence which promotes cessation of 

20 transcription. When the term is used to describe the 
transcriptional terminator sequence for the Bacillus 
frhtTrirrgtgngjg crystal protein gene shown in FIGURES 2A 
and 2B, the term is meant to encompass the sequence 
comprised of base pairs 4140 through 4185. (See FIGURE 

25 2B) . 

As used herein, the phrase "final 74 codons" 
means the final 74 codons of the Bacillus thuringiensis 
crystal protein gene as shown in FIGURES 2 A and 2B. 
More specifically $ the final 74 codons are comprised of 

30 codons 1103 through 1176 (or base pairs 3833 through 
4054) as shown in FIGURE 2B. 

The methods of the present invention make use 
of techniques of genetic engineering and molecular 
cloning. As used herein, genetic engineering means 

35 techniques which lead to the formation of new 
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combinations of heritable material by the insertion of 
nucleic acid molecules , produced or derived by whatever 
means outside the cell f into a bacterial plasmid or 
other vector system so as to allow their incorporation 
5 into a host organism in which they do not naturally 
occur but in which they are capable of replication. 
Host organisms carrying these new combinations of 
heritable material are referred to herein as 
recombinant host organisms. General techniques of 

10 genetic engineering and molecular cloning are included 
in ManiatiS f et al., Molgcaiar Clonlnqr? A -Laboratory 
Msnua* . Cold Spring Harbor Laboratory (1982) . 

Description of the Invention 
Snmmary of the • Invention 

15 The present invention comprises a DNA fragment 

that codes for the portion of Bacillus thurinoi-ensis 
crystal protein peptide that is toxic to lepidopteran 
insects. The invention also comprises the DNA and 
amino acid sequences for the disclosed toxin-encoding 

20 DNA fragment. In addition the invention demonstrates 
that the disclosed toxin-encoding DNA fragment 
(referred to herein as the Bacillus thuringiensts 
crystal protein gene toxin segment) is expressible in 
recombinant host organisms , and that the * toxin" 

25 protein product produced by the transformed hosts is 
toxic to lepidopteran insects. 

Detailed Description of the Invention 
It is known that many subspecies of Bacillus 
fchnringi-gnsts produce crystal proteins that are toxic 

30 to lepidopteran insects* Is is also known that these 
crystal proteins are coded for by crystal protein genes 
(Schnepf and Whiteley, 1981; Kronstad, et al. , 1983; 
Held, et al., 1982) which, depending on the subspecies, 
may be located on large plasmids and/or the chromosome 

35 (Kronstad f et al., 1983; Held, et al., 1982; Klier, et 
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al.r 1982; Schnepf and Whiteley, 1981). 

We have cloned a crystal protein gene from 
BarH-Mug fiyir<nfl<»ns<fl subspecies Kurstaki HD-l-Dipel 
and have shown that the gene is located on a large 
5 plasmid in this strain. Sea U.S. Patent 4 r 467 r 036. In. 
addition we have shown that the crystal protein gene 
from subspecies irnrataici- HD-l-Dipel is homologous to 
crystal protein genes on one or more plasmids (and in 
one case the chromosome of) several Bacillus 
10 »hnrfn«j4 Ana ta strains. Sss. U.S. Patent 4,467,036; also 
ass Kronstad, et al. (1983). 

In a previous communication the 
transcriptional and translational start sites and the 
nucleotide sequence for approximately one-fourth of the 
15 gene cloned from n»eti \ua thuringiensia subspecies 

Kwrsfcalci- HD-l-Dipel were reported (Wong, et al. , 1983). 
However , since more than this partial sequence was 
required to determine which portion of the crystal 
protein gene coded for the "toxin" portion of the 
20 crystal protein, we determined the DNA sequence for the 
remainder, of the gene. 

Our sequencing strategy is shown in FIGURE 1; 
also ass the Materials and Methods section, siipra. The 
DNA sequence is shown in FIGURES 2A and 2B as is the 
25 amino acid sequence deduced from the only open reading 
frame in the DNA sequence. This sequence contains the 
NH2-terminal sequence determined previously (Wong, et 
al.f 1983) by chemical methods. 

The disclosed DNA sequence codes for 1176 
30 amino acids specifying a polypeptide with a calculated 
molecular weight of 133,500. As discussed in Example 
I, this value for the crystal protein protoxin peptide 
agrees well with molecular weights determined for the 
protoxins from nneniua ihnrf ngiensis subsp. Kurstak i 
35 and subsp. Mmr*ng««n«i-g. In addition the deduced 
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amino acid composition is very similar to the 
chemically determined amino acid compositions of either 
whole crystals or the purified protoxin of Bacillus 
fchirringtensig subsp. kurstafrj; (Sfifi Table If JSJIStt.) 

5 Knowing that the Bacillus ttturingiensis 

crystal protein protoxin can be cleaved to yield a 
smaller toxin fragment, we created recombinant 
Escherichia colt strains bearing deletions and fusions 
of the crystal protein gene and then analyzed the 

10 proteins they produced to delineate the portion of the 
gene which encodes the toxin peptide. Construction of 
these "deletion and fusion" plasmids is discussed in 
the Materials and Methods section, suBta; also ££& 
FIGURES 3-7. The truncated peptides produced by 

15 recombinant hosts transformed by these plasmids are 
discussed in Examples II through V. 

Very generally, the truncated peptides 
produced by host strains transformed by the "deletion 
and fusion" plasmids indicate that the amino terminal 

20 55% of the crystal protein gene encodes sufficient 
information to produce a lepidopteran toxin. More 
specifically r the "deletion and fusion" results 
indicate that deletions to the 50th codon from the 5 ! 
end of the gene, or to the 603rd codon from the 3 1 end 

25 abolish toxicity, while deletions to the 10th codon 

from the 5" end, or to the 645th codon from the 3' end 
do not. They also indicate that the 3* end of the 
crystal protein gene, from codons 645 to 1176 is not 
essential for toxicity, and the first 10 codons can be 

30 replaced by two different segments containing the 

N-terminus of beta-galactosidase and synthetic linker 
sequences without eliminating toxicity. 

The shortest toxic segment of the crystal 
protein gene reported here encodes a polypeptide of ca. 

35 M r 73,000; the outermost deletion endpoints for 
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non-toxic mutants (codons 50 through 603) would encode 
a peptide of ca. M r 63,000* Others have reported toxic 
proteolytic fragments of the crystal protein in the M r 
30,000 to 80,000 range (Huber and Luthy, 1981) with 
5 the most recent reports indicating a size of M r 55 r 000 
to 72,000 (Bulla, et al., 1979; Lilley, et al., 1980 
and Chestukhina, et al., 1982), 

While differences in molecular weight 
standards, electrophoresis systems and Bad Hug 

10 thurtrrgtgrrgf"g strains could account for the differences 
between our current results and those of investigators 
reporting toxic fragments of H r 67,000 to 72,000, it is 
also possible that smaller toxic fragments could come 
from a shorter segment of the gene than that delineated 

15 in the present investigation. We point out that the 
major difference between our study and the preceding 
work is that proteases were used in the earlier studies 
to generate smaller toxic peptides from solubilized 
preparations of crystals whereas we have used 

20 Escfrertcfria gall cells to synthesize toxic proteins 
from altered genes. Assuming that the shorter toxic 
peptides are encoded by the segment of the crystal 
protein gene delineated here, our finding that shorter 
segments of the gene produced non-toxic peptides may be 

25 related to this difference in methodology. It is 

possible that deletions into the minimum toxic segment 
of the gene remove amino acid segments which are either 
necessary for attainment of the toxic conformation, or 
are required to prevent "non-toxin" portions of the 

30 remaining polypeptide segment from blocking attainment 
of the toxic conformation of the protein. 

The results of this study are, however, in 
broad agreement with previous work showing that the 
N-terminus of the crystal protein was present on a 
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toxic polypeptide fragment (Chestukhina, et al., 1982). 
In addition, our experiments involving 3'-end deletions 
or gene fusions indicate that there is a segment of 
increased susceptibility to proteolysis between residue 
5 603 and 645 which leads to the production of an 
N-terminal ca. M r 70,000 fragment in the deleted 
strains. The location of this site near residue 645 is 
inferred from the results obtained with successive 
3'-end deletions. More specifically, a' plasmid in 

10 which crystal protein sequences terminate at codon 645 
directs synthesis of little, if any, polypeptide beyond 
the cleavage site. The presence of a processing site 
before residue 645 is implied by apparent removal of 
the Izg alpha-peptide in extracts of Escherichia coii- 

15 containing a plasmid having the lac alpha-peptide 
sequences fused in phase following codon 645 of the 
crystal protein gene. The lac alpha-peptide is 
functional in this plasmid, since it complements the 
beta-galactosidase activity in Escherichia jm83 or 

20 JM103 (data not shown). 

The different N-terminal amino acids reported^ 
by several investigators for the toxic fragments of the 
crystal protein (Lilley, et al., 1980; Chestukhina, et 
al., 1982) may reflect some flexibility in the 

25 N-termini which can be present on these fragments. 

This is supported by our current study which indicates 
that the first 10 codons of this gene can be replaced 
by two different polypeptide encoding segments without 
eliminating toxicity. 

30 The 5 f -end and 3* -end alterations of the 

crystal protein gene reported herein involved fusing 
these sequences to new sequences, some of which had 
unusual effects on the expression of the altered 
proteins in Escherichia call- . When the entire crystal 

35 protein gene was present, the 5'-end fusions to 
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beta-galactosidase at the 10th and 50th codons of the 
crystal protein gene resulted in the production of a 
ca. M r 110,000 antigen in addition to the H r 134,000 
crystal protein. This may indicate an altered 
5 conformation that is more sensitive to degradation. In 
addition, the 50th codon fusion showed little or no 
processing to a ca. H r 70,000 fragment (and no 
toxicity) when two different 3'-end deletions were 
present. However, some M r 70,000 material was detected 

10 when this altered gene had the final 74 codons of the 
crystal protein gene following codon 645. This implies 
that loss of processing and toxicity may be due to a 
conformational change mediated by the alteration at the 
5 1 end of the gene. The 50th codon fusion disrupts the 

15 most extensive hydrophobic segment (data not shown) of 
the crystal protein. The lack of toxicity of the 
bacteria containing this alteration at the 5' end 
implies that this segment of the protein is either 
directly involved in toxicity, or that the alteration 

20 of this segment prevents the attainment of the toxic 
conformation of the protein. 

In the case of the 10th codon fusion, segments 
of 9 amino acids or 19 amino acids can be substituted 
without eliminating toxicity. Preliminary evidence 

25 from titration experiments indicates that the protein 
having the 9 amino acid-substituted 10th codon fusion 
and the protein containing the natural N-terminus 
differ in toxicity by less than about 3 fold, if they 
differ at all. N-terminal substitutions have been 

30 reported for other proteins as well, most notably 

beta-galactosidase (Fowler and Zabin, 1983), the lac 
alpha-peptide (Vieira and Messing, 1982; Messing and 
Vieira, 1982; Traboni, et al., 1983; see also above) 
and chloramphenicol acetyl transferase (Goldfarb, et 

35 al., 1982). In beta-galactosidase (Fowler and Zabin, 
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1983), increased susceptibility to heat or denaturation 
by urea were reported to be due to N-terminal 
substitutions. 

Fusion of the final 74 codons of the crystal 
5 protein gene (codons 1103-1176 in FIGURE 2B) , a segment 
distal to the toxic portion of the molecule, to codon 
645 also has an apparent effect on conformation. 
Although a cleavage product of ca. M r 70,000 is 
produced, a much more prominent cleavage product of ca. 

10 M r 50,000 is also detected. This may, however, be an 
exceptional case since the final 74 residues of the 
crystal protein might be able to form a domain which > 
can recognize and bind to a site in the amino-terminal 
region of the protein. If this binding were to take 

15 place without the intervening polypeptide sequence, the 
overall conformation of the protein might be altered, 
resulting in an increased susceptibility to 
proteolysis. 

In assessing the effects of different 3 1 ends 

20 on gene expression a surprising result was the 

observation that, given an equivalent promoter-ribosome 
binding site configuration, the source of the sequence 
following codon 645 of the crystal protein gene 
influenced the extent of synthesis of the altered 

25 crystal protein. Since we have been unable to detect 
differences in the stability of these polypeptides, 
this difference in expression may reflect differences 
in the stability of mRNAs, although some unusual effect 
on translation elongation or termination cannot be 

30 ruled out. Largely as a result of the study of the 

regulation of the bacteriophage lambda iat gene, where 
gene expression is regulated, in part, by alternative 
RNA structures near a transcription termination site 
(Court, et al., 1983), it has been proposed that 

35 transcriptional terminators have a more general role in 



regulating gene expression by affecting the stability 
of mRNA (Holmes, et al., 1983). 

The termination site used by the altered 
crystal protein genes which end in pBR322 sequences is 
most likely distal to the beta-lactamase gene, and 
consists of three stem-and-loop structures. The mRNA 
of the beta-lactamase gene has a half-life of ca. 3 
min. (von Gabain, et al., 1983). The site at which 
transcription would terminate when reading into Tn5 is 
not known , however it is known that Tn5 is highly polar 
to transcription (Berg and Berg, 1983) . On the basis 
of the result described here, we raise the possibility 
that there is a transcriptional terminator in the first 
485 bp of Tn5. 

The crystal protein gene terminator has a very 
favorable stem-and-loop structure, delta G==-3Q.4 
Kcal/mole (data not shown) , and appears to allow the 
highest level of expression. Possibly, the stability 
of this structure is related to the reported longer 
half-life of the crystal protein mRNA (Petit-Glatron 
and Rapoport, 1975) and the larger amount of toxic 
peptide detected in strains carrying the terminator at 
the 3' -end. 

Details of these and other specific 
embodiments of the present invention are outlined in 
the following examples. Such examples are for 
illustrative purposes only and are not intended to 
limit the scope of the claims in any way. The 
materials and methods utilized in these examples are 
listed below. 

Mafrgri-gis yrnd Methods 
Bacterial: -strarns/ -piasmids 'and -phages. 
Plasmid pESl (s&fi O.S. Patent 4,467,036 issued August 
21, 1984; also fifift Schnepf and Whiteley, (1981)} in 
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Escherichia colt strain HB101 was the source of the 
crystal protein gene DNA. Ml 3 bacteriophage strains 
mp7 f mp8, and mp9 were propagated on Egcherichiar colt 
strain JH103 (Messing, et al., 1981; Messing and 
5 Vieira f 1982). E^chgrichia £gli strains CS412 (Gray 
and Chang, 1981), JM83 and JM103 (Vieira and Messing, 
1982), plasmids pBR322 (Bolivar, et al., 1977), pUC8Ap 
(Vieira and Messing, 1982), p0C13Cm and phage M13mp8 
(Messing and Vieira, 1982) are described in the 

10 references cited for each. Plasmid and double-stranded 
phage DNAs were prepared as described in Birnboim and 
Doly (1979). Transfection of phage DNA was according 
to Mandel and Higa (1970). The methods used for 
transposon 5 (Tn5) mutagenesis , plus descriptions of 

15 some of the Tn5 insertion mutants in pESl are described 
in Wong, et al. (1983) . The positions of several 
additional Tn5 insertions in pESl are shown in FIGURE 
4; they were determined as described in Wong, et al* 
(1983). 

20 Enzymes -and -Radiolabeled 'Compounds 

Restriction enzymes , T4 DNA ligase, and SI 
nuclease were purchased either from New England Biolabs 
or Bethesda Research Laboratories. The large fragment 
of DNA polymerase I and all 3 2 P-labeled nucleotides 

25 were purchased from New England Nuclear. All enzymes 
were used as recommended by the suppliers. 

DMA Sggtigncfng 
The dideoxynucleotide chain termination 
procedure of Sanger, et al. (1977) employing 

30 M13-derived phages as templates was used for DNA 
sequencing as described in Wong, et al. (1983) . 
Template phages for sequencing the internal £zull 
fragment (bases 1845 to 3831) of the crystal protein 
gene were obtained by the DNAse I deletion method of 

35 Hong (1982). Alternatively, in order to bring the 
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seguencing primer adjacent to several of the S2113A1 
sites in the SmiII fragment, the DNA was partially 
digested with Ssii3Al and completely digested with BamHI 
(the latter enzyme cuts only in the polylinker site of 
5 Ml 3 mp8 adjacent to the sequencing primer site) • 
Less-than-full size phage DNA was then purified and 
^circularized with DNA ligase. 

The DNA sequence was translated and analyzed 
by the use of a computer program (Nicollr 1983) 

10 modified for use on the IBM Personal Computer by Dr. D. 
Nicoll and Dr. J. Champoux (Department of Microbiology 
and Immunologyr University of Washington). 

Preparation and -mani-puiatt on of olasmid -and 
pftagg -DMAs. Standard methods were used for preparing 

15 plasmid and phage DNAs and for digesting and modifying 
these DNAs (Maniatis f et al., 1982). DNAse I was used 
in the presence of Mn ++ (Hong, 1982) to obtain 
deletions in phage or plasmid clones. The £cqRI* 
activity was obtained as described by Polisky, et al., 

20 (1975). 

imrnunobiofcti-ng. Whole cell extracts of 
Escfrgrfrghia colt were prepared as follows: 1 ml of 
culture in L-broth was centrifuged for 30 sec. in a 1.5 
ml Bppendorff tube, the supernatant was removed and the 

25 pellet was resuspended in 30 microliters of 0.01 M 

tris, 0.005 M BDTA pH 7.0. Fifty microliters of hot 2X 
sample buffer (0.1 H tris, pH 6.8, 2% sodium dodecyl 
sulfate, 2% beta-mercaptoethanol, 20% glycerol and 
0.01% bromphenol blue) were added and the tube was 

30 plunged into a boiling water bath for 2 min. One-tenth 
ml of a solution containing 7.2 M urea, 1% sodium 
dodecyl sulfate, and 2% beta-mercaptoethanol was added, 
the preparation was sonicated for 30 sec. and then 
boiled for another 2 min. Ten to forty microliters of 

35 this preparation were applied to a 10% sodium dodecyl 



sulfate-polyacrylamide gel and subjected to 
electrophoresis (Wong, et al., 1983). The proteins in 
the gels were transferred electrophoretically to 
nitrocellulose and were reacted with anti-crystal 
antibody and [ 125 U fiMtyhyToeaeeng aureus Protein A as 
described in Schnepf and Whiteley (1981). 

Trrsggtr-feo ^tfftfry -gsaavs. Hatchling tobacco 
hornworm (Uanduca SfiXta) caterpillars were obtained 
from Drs. J. Truman and L. Riddiford (Department of 
Zoology, University of Washington). Cell extracts were 
prepared by sonicating the cells from a 150 ml 
overnight culture in 2 ml of 0.1 sodium phosphate 
buffer, pH 7.4. Cell suspensions of the recombinant 
ggffhgrf ehta cait strains from 20-50 ml of overnight 
culture were prepared in a 0.25 to 0.3 ml volume. The 
extracts (0.2 ml) or suspensions (50 microliter) were 
spread on an agar-based diet (2-4 ml/vial; 4.8 cm 2 
area; Ssa Schesser, et al.r 1977) and allowed to dry. 
One neonate caterpillar was placed in each vial and 
mortality was determined after 5 days. Each 
recombinant strain was tested two to four times in 
quintuplicate. A positive toxicity score indicted that 
all 5 larvae died within 5 days whereas a negative 
score indicted normal growth and development. 

PTggmta eo nsi-riictH-anB. The source of crystal 
protein DNA was pESl (ATCC 31995) or its Tn5 insertion 
derivatives. The strategy used to delineate the 
toxin-encoding portion of the gene was to construct a 
vector (pHES16) which contained the 5' portion of the 
gene and was not lethal to the larvae of the tobacco 
hornworm. In a second construction, an adjacent 
segment of the crystal protein gene (the fliadlll E 
fragment) was added to extend pHES16 thereby creating a 
sequence (pHES19) which encoded a toxic peptide. To 
determine which portion of the flindlll-E fragment 
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contained the 3 1 terminus of the toxic activity , 
subfragments of atndlll-B were added to extend pHES16. 
To determine the 5' margin of the toxic peptide, the 
IscZ alpha peptide was fused to the crystal protein 
5 gene at the 10th and 50th codons. 

pBBSlfl--a -toxin -anatygjg -ttp^t*. Deletions 
were made in pBR322 to remove the Pvuii and Xmn i sites 
at positions 2065 and 2030 in pBR322 (Sutcliff, 1979) 
by digestion with Bgi 31 nuclease followed by 

10 religation. The Xixsl-JBiadlll fragment of pESl-B8 

containing a portion of Tn5 and the 5 1 portion of the 
crystal protein gene with its transcriptional and 
translational start signals (crosshatched bar in FIGURE 
3B) was purified by agarose gel electrophoresis and 

15 DE52 chromatography. This fragment was cloned into the 
Hindi 1 1 and Sail sites of the deleted pBR322 to give 
pHESl 6 (FIGURE 3B) . This vector was digested with 
Hindi I I and bacterial alkaline phosphatase and was then 
used to accept flindlll fragments which would extend the 

20 crystal protein gene in the 3' direction as indicated 
by the dashed lines in FIGURE 4D. 

pBBS19---»gyfrensi on af> frh g -pRBSl-6 -grygt^ 
proteirn -qgng bv -addition -of fchg R tndnr g -f rgoment -of 
yESl. The Hindlll E fragment (FIGURE 3A) was purified 

25 and ligated into pHES16 at the flindlll site* The 

orientation of this fragment in pBES16 was determined 
by digestion with the appropriate restriction enzymes 
and by an analysis of antigenic peptides as assayed by 
immunoblotting. 

30 pBBS32r 23; 25; ~33~and -3Q«gxtgnsj-on of -frhg 

PHBSlrS -crystal: -prate tn -qgng to ~Tn5 -insgrfcion -points -hn 
the ~Hr ndill-B -f raomgnfr. Insertion mutants containing 
Tn5 (s££ Wong, et al. (1983); also ss& FIGURE 4C) in 
the flindlll-E fragment were first deleted by Xho i 
digestion and religation to remove UNA between the Tn5 
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segment nearest the 5 1 end of the crystal protein gene 
and the Xfrzl site in the flindlll-D fragment of pESl as 
shown in FIGURE 4C. The resulting Hfndin fragment 
containing the S'-proximal portion of the flindlll-E 
5 fragment of pESl was purified and cloned into the 

Hindi I I site of pHES16 (FIGURE 4D) . The orientation of 
the cloned flindlll fragment was determined by ihal-fiafcl 
digestion; the correct orientation yielded a Xho i- Pst i 
fragment common to all of these plasmids. pHES31 

10 contains the same insert as pHES32 but in the opposite 
orientation. 

PHES3*/ 33; r~3ft "and 'SB^extensran -of pHESlft 
crystal: -or otetn-Qgng to -gequencgd -gndpoi-nts- The £xilll 
C fragment of pESl was cloned into the Sma l site of 

15 M13mp8 in an orientation which put the end nearest the 
3 1 end of the crystal protein gene closest to the 
flindlll site of the polylinker system of the phage 
(FIGURE 4B) . New flindlll fragments containing 5 1 
proximal portions of the flindlll E fragment were 

20 produced by making deletions. Thus, the insert for 

pHES34 was generated by partial digestion with DNAse I 
in the presence of Mn ++ , followed by digestion with 
Sallr filling in with DNA polymerase I and religation. 
The inserts used for pHES35 and 36 were produced by 

25 partial digestion with £au3AI followed by digestion 
with BamHIf purification and religation. For pHES34 
and 36, the positions of the endpoints of the deletions 
were determined by sequencing. The Sau3AI site of the 
pHES35 endpoint was inferred from restriction enzyme 

30 digestion and comparison with the known sequence. 

After insertion into pHES16 r the orientations of the 
flindlll fragments were determined by £s£I digestion: a 
fragment common to all three plasmids , extending from 
the site in the vector to the site remaining from the 

35 mp8 linker system, was indicative of the correct 
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orientation. 

pHE538 was produced by cloning Xmnl fragment 
No. 5 (FIGURE 3A) into the Sm&I site of H13mp8 in an 
orientation such that the end nearest the 3' terminus 
5 of the crystal protein was nearest the Hindi I I site of 
the mp8 poly linker sequence. The position of the Xsml 
site was determined by sequencing. The resulting 
fliadlll fragment was inserted into the fljtndlll site of 
pHES16 and the orientation was determined as described 
10 above. 

P fl flpl7r and ^ ft^gyfrensi nn -nf -the PBES1S 

pry^irT -grr ^q+fT -wng -ra -welt dgf tn^d -ffndpornts. The 
deletion used to construct pHES35 regenerated the BamHI 
site of the mp8 poly linker system and this was used to 

15 make several additional deletions (FIGURE 4E) . pHES35 
was digested with Bell (243 bases from the 5' proximal 
Hindi I I site of the flindlll E fragment) and aamHI and 
re-circularized by ligation to form pHES37. pHES39 was 
formed by digesting pHES35 partially with the ficaRI* 

20 activity (Polisky, et al.r 1975) followed by complete 
digestion with BamHI and the DNA polymerase I-catalyzed 
fill-in reaction. Purified fragments of the correct 
size were re-circularized by ligation to form pHES39. 
pHES35 was digested partially with Saii3AI, completely 

25 with BamHI and the fragments of the appropriate size 
were purified and re-circularized by ligation to 
produce pHES40. The positions of the endpoints were 
determined by comparing the sizes of the resulting 
Hindlll fragments with the known sequence. 

30 pwBs-ii-. 3Q.«-»r;rvst:aT, -prni-ptn -genes 

nnflgr -tar; promoter "gmrfcrot . The Hdfil C fragment Of 
pESl (FIGURE 3 A) was purified; pUC13Cm was digested 
with maX and the 5' extentions of both the insert and 
the vector were filled in using DNA polymerase I and 

35 deoxyribonucleotides. Following ligation, pHES41 was 
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obtained as a transf ormant containing the Hd&I C 
fragment under the control of the lac, promoter. pH£S41 
was digested with both Bel l and BamHI and was then 
re-circularized by ligation to form pHES43. pHESSO was 
5 constructed by a three-way ligation of pOC13Cm which 
had been digested with Sail and j&aRI, the purified 
SaM - Bcl l fragment of pHES41 which contained the 5 1 
portion of the crystal protein gene, and the 
BamHI-ficaRI fragment of Ml3mp8CN3 which contained the 
10 final 74 codons and the transcriptional terminator of 
the crystal protein gene (FIGURE 5A) • Phage M13mp8CN3 
was constructed by cloning the EsuII-Mdel fragment 
containing the 3 1 end of the crystal protein gene into 
the Sma l site of M13mp8 following a DNA polymerase 
15 I-catalyzed fill-in reaction, in an orientation placing 
the crystal protein coding sequence closest to the 
BamHI site of the phage. The crystal protein gene 
reading frame at this Bam HI site is the same as at the 
Sell site at codon 645. M13mp8CN3 was also used as a 
20 sequencing template for the 3' end of the crystal 
protein gene. 

pHES4 A r -*a.--*8 .- *9"5t -and -52— »ff usrons -at "the 
10th or 50thr -cations . The alpha-peptide of laaZ in pUC8 
was fused in the same reading frame to the crystal 
25 protein at the 10th (pHES 44, 48 and 51) or the 50th 

(pHES 45, 49 and 52) codons. Partial digestion of pESl 
with Xmnl was followed by complete digestion with Xhfll 
and fragments of 5-5.3 Kb were purified. pOC8 was 
digested with BamHI, the 5' extensions were filled in 
30 with DNA polymerase I and deoxyribonucleotides and this 
preparation was then digested with Sail. The pUC8 and 
pESl -derived DNAs were then ligated (FIGURE 5B) . 
Plasraid DNAs from several colonies which hybridized 
with a crystal protein gene-specific probe (Kronstad, 
35 et al. 1983) were screened using restriction enzymes to 
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distinguish pHES44 from pHES45. Although both plasmids 
contained the internal ficaRI fragments of 580 and 732 
bp from this crystal protein gene, additional smaller 
fragments were also found in pH£S44 and pH£S45 f 
5 respectively; these fragments were from the crystal 
protein gene-pUC8 junction. The aamHI sites from the 
10th codon fusions of this series (pHES44, 48 and 51) 
are regenerated, while the 50th codon fusions of this 
series (pHES45, 49 and 52) have lost the BamHI sites. 

10 pHES48 and 49 are 3'-end deletion mutants of pHES44 and 
45, respectively (FIGURE SB). The Sma l- Bgi i fragments 
of the latter plasmids, containing the 5' portion of 
the crystal protein gene, were cloned into the Smal and 
QamHI sites of pUC8. The modified crystal protein 

15 polypeptides produced from these plasmids initiate at 
the Jtac translational start site and reenter the lac 
alpha-peptide sequence out of phase following the 645th 
codon of the crystal protein gene. Plasmids pHESSl and 
52 were made by mixing the same Smal- EcoR i fragments 

20 used for pHES 48 and 49 above with Smal-fiaaRI cut 
pUC13Cm and the ficaRI-fiamHI fragment from M13rap8CN3 
used to make pHES50 (FIGURE 5B) • The fragments were 
ligated and transformed into JM83; white transf ormants 
growing on MacConJcey agar were screened to determine if 

25 they contained plasmids with the predicted restriction 
enzyme digestion products. pHESSl and 52 contain a 19 
amino acid long N-terminal coding sequence from 
beta-galactosidase and the pUC8 linker sequence fused 
to the crystal protein gene at the 10th and 50th 

30 codons, respectively, and are fused, in phase, to the 
lag alpha-peptide following the 645th codon of the 
crystal protein gene at the 3' end. 

PflgS4g"gnd-47-- j lQ and 'SOtti codon fusions 
terminating -irn Tn5 inserts. These plasmids were formed 

35 in a manner analogous to that of pHES44 and 45 above 
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except that the Xaul-Xlxal partial digest fragment were 
from the Tn5 insertion deletion mutant pESl-B22-l 
(FIGURE 5C) . The Tn5 insertion mutant used (B22, 
FIGURE 4 A) was the same as that used to make pHES33. 
5 EXAMPLE I 

DMA -sequence -of -a -crystal' -profcgtn ggng-and the 
dgducgd -amino acrd -sgqugncg -at -itrs -qgrrg -products 

The Baci-ltug thurrngi-gnsis crystal protein 
gene has been partially sequenced (Wong, et al., 1983). 

10 However f since more than this partial sequence was 
required to determine which portion of the crystal 
protein gene coded for the "toxin" portion of the 
crystal protein f the DNA sequence for the remainder of 
the gene was determined, partly with cloned restriction 

15 fragments from the restriction sites shown in FIGURE 1. 
The Pvu ii fragment from bases 1845 to 3831 in FIGURE IB 
was sequenced primarily by the DNAse I deletion method 
of Hong (1982) (&££ numbered sites in FIGURE IB) but 
some gaps in the coding strand sequence were filled in 

20 by obtaining deletions through partial Sau3AI digestion 
(asterisks in FIGURE IB) • The complete sequence was 
determined for both strands except for occasional 
anomalies of one to a few bases on one or the other 
strand* Since the coding sequence was thought to end 

25 200-400 bases distal to the EzuII site at base 3831 

(Wong, et al. , 1983), the determinations were extended 
to base 4222 just beyond a sequence resembling an 
gschgrrcftta coir transcription terminator (Rosenberg 
and Court, 1979) . The latter region was verified by SI 

30 nuclease mapping (discussed below) as the site of 
transcriptional termination. 

As shown earlier (Wong, et al. r 1983), the 
crystal protein gene is transcribed in Bacillus 
t-Hurirrgtgnsis during sporulation from two adjacent 

35 start sites (Bt I and Bt II in FIGURE 2A) . In 
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recombinant Escherichia QQll strains containing the 
cloned gene, transcription begins at a site (Ec in 
FIGURE 2A) located between Bt I and Btll and is 
independent of the phase of growth. The base sequences 
5 of the -10 and -35 regions of Bt I and Bt II differ 
significantly from the consensus sequences recognized 
by RNA polymerase from vegetative Bacillus subtilt^ and 
by EgCtrertcfrta salt RNA polymerase. Klier, et al., 
(1978) reported the isolation of two modified forms of 

10 RNA polymerase from sporulating cultures of Bacillus 
fchirrtrrgi-gngi'st one of these polymerases was found to 
transcribe in ££££2 a crystal protein gene cloned from 
the Chromosome Of Baci-lrlgg fchurtrrgrgnsrs subsp. 
fchirri-ngi-gnstg (Klier, et al., 1983). Interestingly, 

15 the transcribed sequence, which is nearly identical to 
a fl±ndIII/E£aRI fragment of the gene we have cloned 
(FIGURES 2A and 2B, bases 2100-2215) , does not contain 
-10 and -35 sequences corresponding to the sequence 
upstream from either the Btl or Btll start: sites. 

20 The complete sequence of the Bagrl-lm^ 

thurtrrgtensts crystal protein gene is shown in FIGURES 
2A and 2B« FIGURES 2A and 2B also show the amino acid 
sequence deduced from the only extended open reading 
frame in the DNA sequence. The open reading frame 

25 codes for 1176 amino acids specifying a polypeptide 
with a calculated molecular weight of 133,500* This 
value agrees well with molecular weights of 134,000 and 
136,000, respectively, determined for the protoxins 
from Bagtitirg thurtngirgnsts subsp. kiirstafci: (Bulla, et 

30 al., 1981) and subsp. thurtnglensig (Huber, et al., 
1981) . The deduced amino acid composition is very 
similar to the chemically determined amino acid 
compositions of either whole crystals or the purified 
protoxin of Bacillus thuringienstg subsp. tostafci (&£& 

35 Table 1) . 
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EXAMPLE II 

The J^roxtTngr-Dortton-of-the-eryflfral- protein 

gene -encodes "g tox tc -pgpttde . several investigators 
have proposed that the ca. M r =135,000 crystal protein 
5 is a protoxin which is proteolytically processed to 
yield smaller toxic fragments (Huber and Luthy, 1981; 
Bulla f et al., 1979; Lilley, et al., 1980 and 
Chestukhina, et al./ 1982). In addition, our earlier 
immunoblot analysis of Tn5 insertion mutants of pESl 

10 (Sfifi Wong f et al., 1983) revealed that a crystal 

protein antigen of ca. M r 68,000 was correlated with 
the S'-proximal segment of the crystal protein gene. K 
It was important, therefore, to determine whether this 
segment of the gene encoded a toxic peptide and, if so, 

15 to delineate the minimum portion of the gene which 
could produce such toxic fragments. To that end a 
series of pESl-type plasmids were constructed that 
contained insertions and deletions in the crystal 
protein gene. These plasmids were used to transform 

20 Escherichia £ol± host and then the resultant "crystal 
proteins" were analyzed for toxicity. 

The plasmid used to study 3 1 end deletions of 
the crystal protein gene was pHES16 (FIGURE 3B) . This 
plasmid consists of the XJxal-iLtndlll fragment of a Tn5 

25 insertion mutant, pESl-B8 (crosshatched bar in FIGURE 
3B) , which includes 485 bp of Tn5 and a portion of the 
crystal protein gene containing the promoter and the 
first 565 codons, placed in a modified pBR322 ( see 
FIGURE 3 and the Materials and Methods section, aiEUf 

30 for details of this and ensuing plasmid constructions) . 
Escherichia coli strains carrying this plasmid produced 
a ca. M r 58,000 crystal protein antigen and were not 
toxic to caterpillars (FIGURE 6A, and lane 8 of FIGURE 
6B) . 

35 To determine if extension of this crystal 
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protein gene could restore toxicity, the fliadlll-B 
fragment of pESl (FIGURE 3A) was inserted into the 
Hindlll site of pHES16. When the Kindlll-E fragment 
was inserted in the proper orientation, the recombinant 
5 Escherichia colt strain bearing the resulting pi a sin id, 
PHES19, was toxic to caterpillars (FIGURE 6A) . 
Escherichia qq!£ cells carrying pHES19 produced several 
polypeptides which reacted with antibodies to the 
crystal protein, the most prominent of which were ca. 

10 M r =104,000 and 70,000 (lane 2 of FIGURE 6B) . 

In general, throughout these experiments, it 
was observed that when substantial amounts of crystal 
protein antigen were detected, an array of antigenic 
polypeptides was seen (e.g., lanes 1-6 of FIGURE 6B) 

15 whether the entire crystal protein (lane 1 of FIGURE 
6B) or truncated derivatives (lanes 2-6 of FIGURE 6B) 
were synthesized. These polypeptides were distinct 
from smaller cross-reactive peptides produced by 
Escherichia cqli carrying pBR322 (lane 10 of FIGURE 

20 6B) . We presume that the multiple peptides originating 
from the crystal protein gene were produced either 
cotranslationally or post-translationally by 
proteolysis in Escherichia colt* 

EXAMPLE III 

25 Delineation 'Qt "the ~end-gf 'the 

t,qxirt-encaa±nq -segment . The experiments outlined in 
Example II demonstrated that the 5 '-proximal portion of 
the crystal protein gene encodes a toxic peptide and 
that the 3' end of the toxin-encoding portion was in 

30 the fllndlll-E fragment. To determine the 3* end of 
this region more precisely, a number of deleted 
plasmids were constructed. Ss& FIGURE 6A. The crystal 
protein encoding sequence in these plasmids terminated 
either in Tn5 f where all reading frames close within 30 

35 bases of the end of the transposon (boxes in FIGURE 6A; 
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Auerswald, et al., 1981) or in pBR322 f where all 
reading frames close within 2-17 codons (no boxes in 
FIGURE 6A; Sutcliff, 1979). FIGURE 6A also indicates 
the results of toxicity tests of extracts of these 
5 recombinant strains performed on the tobacco hornworm. 

As indicated in FIGURE 6A, four of the deleted 
plasmids having the crystal protein gene terminated by 
Tn5 (pHES32, 23 f 25 and 33) conferred toxicity to 
Egchgrtchta qq1± while one (pHES30) did not. Of the 

10 nine deleted plasmids with crystal protein gene 
sequences terminating in pBR322, five conferred 
toxicity (pHES19, 34 r 35, 36, 37) and four (pHES38, 39, 
40 and 16) did not. Plasmid pHES38 contains 603 codons 
of the crystal protein gene, which is the longest 

15 non-toxic segment of the gene tested, while plasmid 

pHES37 contains the shortest toxic segment: 645 codons 
of the crystal protein gene, 5 codons from the M13mp8 
linker sequence and 2 codons from pBR322. 

Panels B and C of FIGURE 6 show an immunoblot 

20 assay of the polypeptides produced by recombinant 

strains containing these plasmids. As shown in lanes 
2-6 of FIGURE 6B and lanes 2-5 of FIGURE 6C, deleted 
plasmids which directed synthesis of crystal protein 
fragments of 645 codons or longer (and made Escherichia 

25 colt toxic to caterpillars) synthesized crystal protein 
antigens of ca. M r 70,000 and an additional longer 
peptide which had a size roughly proportional to the 
length of the segment of the crystal protein gene 
beyond codon 645. Some additional fainter bands were 

30 also seen, as mentioned above. The full-length crystal 
protein as encoded by pESl-B8, is shown in lane 1 of 
FIGURE 6B and 6C for comparison. Escherichia qqXI 
strains containing deleted plasmids having less than 
645 codons of the crystal protein gene (lanes 7-9 of 

35 FIGURE 6B and lanes 6-8 of FIGURE 6C) were not toxic to 
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caterpillars and produced crystal protein antigens of 
ca. M r 58,000-60,000. The antigenic polypeptide in 
lane 9 of FIGURE 3B was produced from pHES31 which 
contains the flindlll insert of pHES32 in the opposite 
5 orientation; extracts of the strain carrying this 
plasmid were not toxic* In most cases (lanes 7-9 of 
FIGURE 6B and lanes 7 and 8 of FIGURE 6C) , the ca. M r 
58 f 000 polypeptide produced by the nontoxic strains was 
detected poorly by the immunoblotting analysis* This 
10 may be due to an increased susceptibility to 

proteolysis or might indicate the loss of a major 
antigenic determinant of the crystal protein. 

EXAMPLE IV 

5 » and -3*»~madif tcartiong-of' -the -crystal --protein 

15 sens. Knowing that the 5 '-proximal portion of the 

crystal protein gene encodes the toxic peptide and that 
the 3' end of the "toxin-encoding" gene segment is 
located between codons 603 and 645 (s££ FIGURE 2B) more 
plasmids containing altered crystal protein genes 

2 0 (under control of either the lafi promoter , or the iac 
promoter plus the beta-galactosidase translational 
initiation site) were constructed to assess the effect 
of additional changes at the 5' and 3' ends of the gene 
on toxicity. These plasmids are diagrammed in FIGURE 

25 7A; their construction is described in Materials and 
Methods section/ £up£2. 

For this study the 5 1 end modifications to the 
crystal protein gene were of two types, and were 
located at two places in the gene: 1} the first 10 * 

30 codons of the crystal protein gene were replaced with 9 

codons (pHES44, 46 and 48) or 19 codons (pHESSl) of the > 
beta-galactosidase and polylinker system of pUC8 and 
pUC13, respectively r or 2) the first 50 codons of the 
crystal protein gene were replaced by the first 9 

35 (pHES45, 47 and 49) or 19 (pHES52) codons of the 
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beta-galactosidase and polylinker system of pOC8 and 
pDC13 f respectively. Plasmids pHES41, 43 and 50 were 
under the transcriptional control of the lac promoter 
but were under translational control of the crystal 
5 protein ribosome binding site and served as controls 
for the effects of the 5 '-end modifications: 1) the 
crystal protein coding sequencer stop codon, and 
transcriptional terminator (pHES41, 44 and 45), 2) 
codon 645 of the crystal protein gene fused either in 

10 phase (pHES43) or out of phase (pHES48 and 49) with the 
reading frame of the subsequent portion of the lac 
alpha-peptide, 3) the Tn5 insertion site B22 (FIGURE 
3B) in pESl (pHES46 and 47) , 4) codon 645 of the 
crystal protein gene re-connected in phase to the final 

15 74 codons and the transcriptional terminator of the 
crystal protein gene (pHESSO, 51 and 52) . 

Extracts of Escherichia qqJl± containing these 
plasmids were tested for toxicity to hatchling 
caterpillars. Our results indicate that plasmids 

20 having the translational start site of the crystal 
protein gene or the beta-galactosidase fusion to the 
10th codon of the crystal protein gene were toxic 
regardless of the modification to the 3' end of the 
coding sequence. The beta-galactosidase fusions to the 

25 50th codon of the crystal protein gene were all 
non-toxic r irrespective of the 3" -terminal coding 
sequence. 

EXAMPLE V 

Ef f gcfcs ~o£ -the -5 *--»and 3 ^end -a Iterations . 
30 This example demonstrates that alterations of the 

crystal protein at the 5 1 end or at both ends affected 
production of the crystal protein antigen. 

As shown in lanes 1-3 of FIGURE 7B r 
alterations to the 5 f end of the whole crystal protein 
35 gene allowed production of a ca. M r 135,000 antigenic 
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peptide. While the presence of the crystal protein 
N-terminus (pHES41, lane 1) allowed production of 
several minor apparent cleavage products in addition to 
the full-sized crystal protein, strains containing 
5 fusions at the 10th codon (pHES44, lane 2) and 

especially at the 50th codon (pHES45 f lane 3) produced 
one (or more) prominent apparent cleavage product of 
ca. H r 110,000. Lanes 4-6 of FIGURE 7B show the 
crystal protein antigens produced by bacteria 

10 containing pHES43, 46 and 47 , respectively. The 

proteins produced from these plasmids had sizes of ca. 
M r 77,000, 85,000 and 80,000, respectively, in accord 
with the size of the coding sequence present in these 
plasmids. In addition, lanes 4 and 5 (the crystal 

15 protein N-terminus and the 10th codon fusion, 

respectively) contained polypeptides in the ca. M r 
70 f 000 range, while no polypeptide of this size was 
seen in lane 6 (the 50th codon fusion) • 

The results of an immunoblot. assay of extracts 

20 of Escherichia qqXI carrying plasmids with additional 
3' -end alterations following codon 645 of the crystal 
protein gene are shown in FIGURE 7C. As noted above, 
the natural N-terminus or the 10th codon fusion allowed 
processing of some of the crystal protein antigen to a 

25 ca. M r 70,000 form. Lanes 1-4 of FIGURE 7C show that 
this was true whether the 3' sequence following residue 
645 was the beta-galactosidase alpha-peptide, either in 
phase (pHES43, lane 1) or out of phase (pHES48, lane 3) 
or the final 74 codons of the crystal protein gene 

30 (pHESSO, lane 2; pBESSl, lane 4). If the peptide 
synthesized in the strain containing the 50th codon 
fusion were processed when the 3'-end sequence was the 
out-of -phase lac alpha-peptide, it would be expected 
that lane 5 would show two bands, each ca. M r 4,000 

35 smaller than the two bands of ca. M r 70,000 and 
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74,000 seen in lane 3. Only the larger of these two 
expected bands was detected. Lane 6 of FIGURE 7C shows 
that some cleavage of the 50th codon fusion protein to 
a ca* M r 70 , 000 peptide took place when the final 74 
5 codons of the crystal protein gene were present 

following codon 645. FIGURE 7C also shows that when 
the final 74 codons of the crystal protein gene were 
fused in phase following codon 645, the same prominent 
peptide of ca. M r 50,000 was detected in extracts of 

10 strains having different 5 1 ends (lanes 2, 4 and 6 of 
FIGURE 7C) , suggesting that sequences distal to residue 
645 can influence the overall conformation of the 
altered polypeptide. Since this common cleavage 
product was found in extracts of a strain carrying the 

15 non-toxic plasmid pHES52, it seems likely that the ca. 
M r 50,000 peptide would be non-toxic. 

The immunoblot analysis in FIGURES 6B, 6C and 
7C indicates that when the ca. M r 70,000 crystal 
protein fragment was produced, the amount of antigen 

20 detected was affected by the sequences distal to the 
crystal protein coding sequences. As shown in lane 2 
of FIGURE 6B and lanes 2-5 of FIGURE 6C, the amount of 
antigenic material synthesized by strains bearing 
plasmids with crystal protein sequences fused at the 

25 3'-end to pBR322 was greater than when the crystal 
protein sequences terminated in Tn5 (lanes 3-6 of 
FIGURE 6B) • It seems unlikely that this difference was 
due to the loss of a particularly immunogenic segment 
of the protoxin since both longer and shorter 

30 pBR322-terminated plasmids promoted the increased 

synthesis of the antigen when compared to the amount 
made when Tn5 was at the 3 f end. Preliminary estimates 
of the rate of protein degradation in these strains, 
which were made by performing immunoblot analyses on 

35 extracts of cultures at various times after protein 
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synthesis was blocked by the addition of 
chloramphenicol, indicated that the rate of degradation 
of the altered crystal protein was not markedly 
different in strains having pBR322 or Tn5 sequences 
5 distal to the crystal protein gene coding sequences 

(data not shown). As shown in FIGURE 7C, more antigen 
was produced when the 3* end of the crystal protein 
gene was present (lanes 2, 4 and 6) than when 
transcription terminated in the pUC vectors (lanes 1, 3 
10 and 5 of FIGURE 7C) . 

XablfiS 
Table I 
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Thus it can be seen that the present invention 
discloses a toxin-encoding segment of a Bacirilug 
tfturlngtensts crystal protein gene that is expressible 
5 in recombinant host organisms. The present invention 
further discloses that the "toxin" protein product 
produced by these transformed recombinant strains is 
toxic to lepidopteran insects. Since the toxin peptide 
is approximately half the size of the Bacillus 

10 tfturingi'ffnsts crystal protein protoxin peptide , 
standard insecticidal preparations containing the 
protoxin crystals can be made to be twice as effective 
per given dose (or equally as effective at half the 
dose) by utilizing the smaller toxin fragment instead 

15 of the larger protoxin polypeptide. 

Various modifications of the invention in 
addition to those shown and described herein will 
become apparent to those skilled in the art from the 
foregoing description and accompanying figures. Such 

20 modifications are intended to fall within the scope of 
the appended claims. 
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WHAT IS CLAIMED IS: 

1. A DNA fragment encoding the toxin portion 
of a Bactitgg tfturtngiensts crystal protein peptide 
wherein said DNA fragment is a toxin-encoding DNA 
sequence having substantial sequence homology with the 

5 toxin-encoding DNA sequence beginning with base pair 
527 and ending with base pair 2461 as shown in FIGURES 
2 A and 2B* 

2. A composition of matter comprising the 
amino terminal 55% of a Bacillus thtrrtngt-gngi-^ crystal 

10 protein gene as shown in FIGURES 2A and 2B. 

3. A DNA sequence having substantial sequence 
homology with the DNA sequence comprising codons 1 
through 645 (base pairs 527 through 2461) as shown in 
FIGURES 2 A and 2B. 

15 4. A DNA sequence according to Claim 3 

wherein the sequence coding for codons 1 through 10 is 
replaced by a DNA sequence that codes for an amino 
terminus (N- terminus) derived from substance X. 

5. A DNA sequence according to Claim 4 

20 wherein substance X is comprised of beta-galactosidase 
and a synthetic linker. 

6* A DNA sequence having substantial sequence 
homology with a first DNA sequence comprised of base 
pairs 527 through 2461 (codons 1 through 645) as shown 

25 in FIGURES 2 A and 2B, and a second DNA sequence 
comprised of base pairs coding for the carboxyl 
terminus of beta-galactosidase wherein said first and 
said second DNA sequences are linked by a synthetic 
linker. 

30 7. A DNA sequence having substantial sequence 

homology with a DNA sequence comprised of a first DNA 
sequence comprised of base pairs 527 through 2461 
(codons 1 through 645} , a second DNA sequence comprised 
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of base pairs 3833 through 4054 (codons 1103 through 
1176) and a third DNA sequence comprised of base pairs 
4140 through 4185 (the transcriptional terminator) as 
shown in FIGURES 2A and 2B* 
5 8, A recombinant microbial strain transformed 

to express a polypeptide of ca. 73 , 000 M r having the 
immunological properties of the toxin portion of the 
crystal protein of Bactllag ^urin ? ^ngt g . 

9. A recombinant microbial strain according 
10 to Claim 8 wherein said strain is selected from the 

group comprised of bacteria and yeast* 

10. A recombinant bacterial strain 
transformed to express a polypeptide of Ca* 73,000 M r 
having the immunological properties of the toxin 

15 portion of the crystal protein of Bgci3-lug 
fchuringigngig . 

11* A recombinant bacterial strain according 
to Claim 10 wherein said bacteria are selected from the 
group comprised of Escherichia coltr Bacillus auhttlis, 

20 and bacteria that are endogenous to green plants* 

12. A recombinant Escherichia £gli bacterial 
strain transformed to express a polypeptide of ca. 
73 f 000 M r having the immunological properties of the 
toxin portion of the crystal protein of Bgciiiug 

25 tfturtngtenstg* 

13* A hybrid recombinant plasmid capable of 
replication in a bacterial host species, said plasmid 
containing expressible heterologous DNA coding for a 
polypeptide of ca. 73,000 M r which has the 

30 immunological and toxic properties of the toxin portion 
of crystal protein of Ba-cititrs thuringignsis . said 
plasmid further including an expression mechanism for 
said heterologous DNA which is recognized by the host 
species 1 system* 

35 14. A hybrid recombinant plasmid capable of 
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replication in an Escfrgri-ehla cat± bacterial host 
species , said plasmid containing expressible 
heterologous DNA coding for a polypeptide of ca. 73,000 
M r which has the immunological and toxic properties of 
5 the toxin portion of crystal protein of Bagirius 
fchnrtrrgi-gngi-g. said plasmid further including an 
expression mechanism for said heterologous DNA which is 
recognized by the host species 1 system. 

15. A method for producing the toxin portion 

10 of a Bacillgs thuringiensts crystal protein peptide 
comprising utilizing a recombinant plasmid which 
contains the amino terminal 55% of a Bacirrusy 
tfturtTTgfffnsfg crystal protein gene as shown in FIGURES 
2A and 2B to transform microbial host organisms. 

15 16. A method for producing the toxin portion 

of a Bacillus thurrnoignsfs crystal protein peptide 
comprising utilizing a recombinant plasmid which 
contains a DMA fragment having substantial sequence 
homology with the toxin-encoding DNA sequence beginning 

20 with base pair 527 and ending with base pair 2461 as 
shown in FIGURES 2A and 2B to transform microbial host 
organisms. 

17. A method for producing a peptide that 
contains the toxin portion of a Baci-itus tfturingi-gnsis 

25 crystal protein peptide comprising utilizing a 

recombinant plasmid which contains a DNA sequence 
having substantial sequence homology with a DNA 
sequence comprised of a first DNA sequence comprised of 
base pairs 527 through 2461 (codons 1 through 645) , a 

30 second DNA sequence comprised of base pairs 3833 

through 4054 (codons 1103 through 1176) and a third DNA 
sequence comprised of base pairs 4140 through 4185 (the 
transcriptional terminator) as shown in FIGURES 2A and 
2B to transform microbial host organisms. 

35 18. Toxin peptides produced by recombinant 
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host strains transformed to express the DNA sequences 
claimed in any of Claims 1 through 7, 

19. Toxin peptides produced by the 
recombinant host strains claimed in any of Claims 8 

5 through 12. 

20. Toxin peptides produced by recombinant 
host strains transformed by the recombinant plasmids 
claimed in any of Claims 13 and 14. 

21. A method for increasing the amount of 
10 toxin peptide produced by a recombinant strain 

comprising utilizing a recombinant plasmid containing 
the DNA sequence of Claim 7 to transform microbial host 
organisms. 

22* A plant transformed to express a 

15 polypeptide of ca. 73 , 000 M r having the immunological 
properties of the toxin portion of the crystal protein 
of Bacillus tfturingrenstSr wherein said plant is 
transformed by a vector containing an expressible DNA 
fragment having substantial sequence homology with the 

20 toxin-encoding DNA sequence beginning with base pair 
527 and ending with base pair 2461 as shown in FIGURES 
2 A and 2B. 

23. A plant transformed to express a 
polypeptide of ca. 73 f 000 M r having the immunological 

25 properties of the toxin portion of the crystal protein 
of Bacillus thirrtngtgnsi-s wherein said plant is 
transformed by a vector containing an expressible gene 
portion comprised of the amino terminal 55% of a 
Bacillus thurtngirensi-s crystal protein gene as shown in 

30 FIGURES 2 A and 2B. 
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FIG. I-B 
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